UMBC High Performance Computing Facility : Distributed Principal Direction Divisive Partitioning
This page last changed on Jun 29, 2009 by gobbert.
Jacob Kogan, Department of Mathematics and Statistics Clustering is used in a number of traditionally distant fields to describe methods for grouping of unlabeled data. Clustering very large datasets is a contemporary data mining challenge. This project concerns an application of Principal Direction Divisive Partitioning clustering algorithm (PDDP) introduced by D. Boley to a dataset residing in a number of computers connected in a network. Performance of PDDP and Distributed PDDP for datasets of moderate size will be compared. |
Document generated by Confluence on Mar 31, 2011 15:37 |